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We describe a model for special-service circuit activity to assist in forecast- 
ing, provisioning, and "churn" studies. We assume that customers order a 
random number of circuits for an exponentially distributed period of time and 
that the rate of new connect orders grows exponentially with time. These 
assumptions yield simple formulae giving the means and variances of the 
number of active circuits at a future time and the total number of connected 
and disconnected circuits during a future period. Distributions of these vari- 
ables can, in principle, also be computed. There are three important parameters 
characterizing the model: growth rate, disconnect rate, and batchiness; we 
describe their physical meaning and discuss methods to estimate them. This 
document describes the analytical portion of an effort to develop a model 
based on the physics of special-service circuit activity. 

I. INTRODUCTION 

The purpose of this paper is to describe a model for special-service 
circuit activity to assist in forecasting, provisioning, and "churn" 
studies, which can be summarized by a few parameters that have a 
physical interpretation. The calibration and measurement of the fit of 
this model to data in a New Jersey Bell database is being pursued 
simultaneously and will be reported elsewhere. 

The model treated here is derived from a priori consideration of the 
physical behavior of customers. It is based on the assumption that the 
number of active circuits, although growing, is in some sense in 
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equilibrium as well; that is, certain characteristics of the system are 
not changing. This is to be contrasted with a model proposed by Nucho 
in which transient analysis is fundamental. 1 The primary difference 
between these models is that the demand rate for new circuits is a 
function of the number of active circuits in the Nucho model, whereas 
it is considered to be an exogenous variable here. In the Nucho model, 
the variance to mean ratio of the number of active circuits increases 
indefinitely with time (since fluctuations tend to feed on themselves); 
in the model considered here this ratio remains constant. Another 
difference between the models is that the model described here allows 
an order to be for more than one circuit. 

Here, we assume that (1) the arrivals of special-service circuit orders 
are given by a nonhomogeneous Poisson process with exponentially 
growing intensity, (2) each order is for a random number of circuits (a 
batch) with arbitrary distribution, and (3) the lifetime of an order is 
an exponentially distributed random variable, during which time the 
number of held circuits per order remains constant. Note that the last 
assumption implies that an order lifetime and a circuit lifetime have 
the same distribution. 

We use three important parameters in special-services modeling, 
each with its own physical interpretation. These parameters may be 
described as growth rate, disconnect rate (per circuit), and batchiness. 

The growth rate summarizes the rate at which the mean number of 
active circuits increases with time. It may be expressed in terms of 
proportion increase per unit of time; we denote it 0. Thus the mean 
number of circuits at time t is proportional to e 0t . We actually assume 
that connect activity grows at rate 0, but it turns out that the number 
of active circuits, the total connect rate, and the total disconnect rate 
are all proportional to e* in this model. Of course, for small growth 
rates or short periods of time, exponential growth is very close to 
linear growth. 

The disconnect rate, denoted n, is the ratio of the number of 
disconnects per unit time (i.e., the total disconnect rate) to the number 
of active circuits. The mean circuit lifetime is then 1/n. The distribu- 
tions of circuit lifetimes have been shown to be well approximated by 
negative exponential distributions; 2 thus the disconnect rate does not 
vary with the age of a circuit. 

The batchiness of the arrival process is related to the tendency of 
special service circuits to be ordered in multiples greater than one. We 
call the batchiness parameter v and define it to be the ratio of the 
second moment to the first moment of the number of circuits in an 
order. 

The ultimate goal of this modeling process is to provide a tool that 
can be used to predict special-services needs in the future. The model 
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contained herein should be very useful in this regard. One should 
remember that the underlying process is stochastic so that there is a 
fundamental uncertainty even if one has exact specification of the 
parameters of the model. The standard deviation of future require- 
ments can be quite large compared to the mean for small circuit 
groupings, and this presents a major problem for provisioning at the 
most detailed level. This problem cannot be surmounted with a better 
model and/or additional data collection. The present analysis allows 
quantification of the fundamental uncertainty of forecasting, an in- 
sight which is difficult to obtain purely by statistical methods. The 
only possible method to further decrease relative uncertainty is to 
aggregate demand, or to obtain advance knowledge of connect or 
disconnect activity (sometimes called "deterministic events"). 

The rest of the paper is organized as follows: Section II summarizes 
the important results of the paper, giving formulae for the means and 
variances of the number of active circuits in the future, the total 
number of connects in a future interval, and the total number of 
disconnects in a future interval; and giving statistical methods to 
estimate the fundamental parameters of the model such as growth 
rate, disconnect rate, and batchiness. The reader not interested in the 
derivation of these results may stop at this point. 

The predictions (summarized in Section II) of the model are derived 
in Section IV. These derivations are primarily substitutions into 
formulae given in Section III. Section III describes and analyzes a 
much more general model than the one described in this introduction 
(we refer to the latter simply as "the model"). We have chosen to 
introduce this generalized model for two reasons. First, the analysis 
required for the treatment of the generalized model is little different 
in complexity from that required for treatment of the specific model. 
Second, the general results of Section III allow rapid exploration of 
the consequences of changes in assumptions of the model. For example, 
one can explore the effects of linear growth of demand, or the super- 
exponential growth in demand which follows introduction of a new 
service. However, we do feel that the original assumptions are appro- 
priate in most circumstances. Thus, the consequences of this model 
are the only ones summarized in Section II, and it is this specific 
model which is being verified with respect to the New Jersey Bell 
Telephone Co. database. Thus, Section III is provided for reference in 
case of non-typical special service applications. 

Section V derives the statistical methods (summarized in Section 
II) for estimation of the fundamental parameters of the model. Section 
VI is a summary. 

Appendix A gives background information on the compound Poisson 
random variable, and Appendix B gives background information on 
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the non-homogeneous Poisson process. These results are needed in 
Sections III and IV. 

Table I presents values of a function useful in estimating growth 
(see Section II) and Table II lists the notation used in the paper. 

II. SUMMARY OF KEY RESULTS 

This section provides a summary of the important results of the 
paper derived in Sections IV and V. 

2.1 Churn 

Our model depends on three physical parameters: growth rate (/3), 
disconnect rate (n), and batchiness (v). The meaning of these param- 
eters is described in Section I. Another physical parameter is "churn," 
which has been defined in many different ways. For any reasonable 
definition, the churn is determined by the growth and disconnect rates 
of the model. We define the churn to be the minimum of the disconnect 
rate per circuit and the connect rate per circuit, and denote it by 7. 
With this definition, it can be shown [see (75)] that 

7 = min(/j, n + /J). (1) 

The values of churn under other definitions are also readily available. 
For example, if one defines churn to be the ratio of the average total 
connect rate to the average rate of change of net active circuits, then 
this value of churn is (1 - 7)" 1 . Under still another definition, the 
churn equals n/(n + /?). 

2.2 Mean and variance of total active circuits at a future time 

Here we give the mean M(t) and the approximate variance V{t) of 
the number of circuits in service at a given time t in the future. The 
mean and the variance depend on the present (at time t = 0) number 
k of circuits in service, the present instantaneous rate D of circuit 
demand due to new orders, and the three key parameters described 
previously: /?, n, and v. We give these two relationships below: 



M(t) = he-" + -^- ie" - e-"), (2) 

H + P 



and 

V(t) = v 



M + P 



>]■ 



It is interesting that (2) and (3) together imply the relationship 

V(t) = u(M(t) - he- 2 " 1 ), (4) 
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which relates the variance of a forecast to the mean of the forecast, 
the number of circuits currently active, k, and the parameters v and p. 

2.3 Mean and variance of the total number of connected or disconnected 
circuits in the future 

Similar results are available for the mean and variance of the total 
number of connected circuits (variables subscripted with a C) and the 
mean and variance of the total number of disconnected circuits (sub- 
scripted with a D) in an interval of length t beginning immediately: 

M c (t) = ^ (e» - 1), (5) 

V c (t) = v ^ (e* - 1) = uMdt) (6) 

p 

M D (t) = k(l - e-"<) + a/ D °*. e* 
Pin + p) 

D ° r*-a (7) 



M + ^ P' 



and 



VM = » \kr*(l - e-«) + -?f- e« + -2±- e~« - - D ° 

[ p(n + p) n + 

= u(M D (t) - he- 2 " 1 ). (8) 

In this case, the total numbers of connected and disconnected 
circuits are dependent random variables. 

We may also obtain the coefficient of correlation p between the 
number of active circuits at different times 

p[Y(t), Y(t + t)) = e-<M+0/2) T> (9) 

where Y(t) is the number of active circuits at time t. 

2.4 Estimation of the model parameters 

To use results such as (2) through (9), we must be able to estimate 
the parameters p, D ot v, and p. These questions are addressed in 
Section V; we provide a brief summary here. Suppose that the system 
has been observed over the interval [—0, 0] and n connect orders are 
observed at times t u ..., t n . Form the statistic 

'i 
S = I ti/nB + 1, (10) 
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and then the maximum likelihood estimator for the growth rate @ is 

P = I rHS), (ii) 

where / is the function given in (79). Values of / _1 are available in 
Table I. Once $ has been obtained from (11), the estimator D for the 
instantaneous present demand D (assumed to be at the end of the 
interval of observation [-0, 0]) is 

4 - r^n * (12) 

1 — e - " 8 

where N is an estimator for the average number of circuits per order 
and is equal to the average number of circuits actually observed per 
order. The estimator v for the batchiness v of the order size is 

I k 2 i k 
v = *=? , (13) 



I ki 



k=\ 



where i k is the observed number of existing orders of size k. The 
estimator p. for the parameter n can be obtained as the average 
disconnect rate for observed circuits 



M = 



2. (H) 



where m is the total number of disconnects observed, and r is the sum 
of the observed connection times for all circuits; fi can also be obtained 
from estimators of the churn and growth rate through the use of (1). 
Estimation of these parameters from data supplied by New Jersey 
Bell Telephone Co. is being investigated. Estimates of the disconnect 
rate Ji by service family are available in the Reed and Smith paper, 2 
in which it is shown that the lifetimes of special-service circuits are 
well approximated by exponential functions with means dependent on 
the service families. 

III. A GENERALIZED MODEL 

This section treats a model that is more general than that which we 
propose for special-service activity in most cases. The analysis pre- 
sented here will be applied to the specific model in Section IV. 

3. 1 Description of the generalized model 

We examine an arbitrarily defined category of special-service cir- 
cuits (for example, circuits of a particular service family in a given 
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wire center) and divide the active circuits into independent groups. 
Possibly, each group is the demand from a single user, since it is 
reasonable that the activity of one user does not affect another. To 
facilitate this method of thinking we shall refer to the groups as 
"orders." Each order becomes nonzero for the first time at some point 
in time (referred to as the arrival or connect time of the order) and 
then has some history of changing size in some arbitrary manner 
before possibly becoming zero again indefinitely at some time (the 
departure or disconnect time of the order). The length of the interval 
between the arrival and departure of an order will be called the lifetime 
of the order. Obviously, the number of active circuits at any time 
equals the sum of the sizes of the existing orders at that time. 

We assume that there is a large pool of customers (or potential 
orders) so that the arrival of an order has little effect on the potential 
arrival of others. Thus, the arrival of orders can be modeled by a 
nonhomogeneous Poisson process, whose intensity at time t is given 
by some function X(t). For background on this process see Ross 3 or 
Karlin and Taylor. 4 Denote the probability that an arriving order at 
time t is initially of size m as q m (t), and let P£„(£, x) be the probability 
that an order arriving at time t of initial size m as becomes size n at 
time x 5= t. 

3.2 Distribution of the number of active circuits at a given time 

Since the orders are noninterfering it can easily be seen (see Appen- 
dix B) that the number of orders of size n at time x is Poisson 
distributed with mean a n (x), where 



-1/ 



a n (x) = X X(t)q m (t)P* mn (t, x)dt, (15) 

and that the numbers of orders of different sizes at time x are 
independent of each other. If Y(x) is the total number of active special- 
services circuits in the category of interest at time x, then Y(x) has a 
compound Poisson distribution (see Appendix A), and 

E[Y(x)] = l na n (x), (16) 

and 

var[Y(x)] = J n 2 a n (x). (17) 

3.3 Distribution of future active circuits due to present orders 

The transient behavior of this model is easily derived if one has 
knowledge of the distribution of order sizes at a given time. We treat 
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this case first and then consider the more difficult case where only the 
total number of active circuits at a given time is known. In either case, 
we will find the distribution of the number of active circuits at time y 
resulting from the orders observed to be active at time x. The total 
circuits active at time y is the sum of this with the number of circuits 
at time y resulting from orders arriving between x and y. 

Case 1: Order sizes known 

Given an order is of size n at time x, the conditional density that it 
arrived as an order of size m at time t is p m „(t, x), where 

'-«• x) ! ^M (18) 

and if several orders of size n are present at x their arriving times and 
sizes may be considered to be conditionally independent (see Appendix 
B). Thus an order of size n at time x becomes an order of size I at time 
y 2s x with probability rAx, y) where 

r„,(x, y) = otnHx) I I \(t)q m {t)P* mn (t, x)q mnl {t, x, y)dt, (19) 

m=l * / -°° 

and q mn i(t, x, y) is the conditional probability that an order arriving as 
size m at time t which is of size n at time x becomes size / at time y. 
Note that q mnl is not available solely from P*. 

Equation (19) allows us to compute the distribution of the total 
number of circuits at time y that were due to orders observed at time 
x, since all orders behave independently. Evaluating these distributions 
explicitly can be quite difficult. We can, however, easily evaluate the 
moments. Let M„(x, y) be the mean order size at time y for an order 
observed to be size n at time x, and let V n (x, y) be the mean order size 
at time y for an order observed to be size n at time x, and let V n (x, y) 
be the analogously defined variance. Then 

M n (x, y)=l IrAx, y), (20) 



/=i 



V n (x, y) = S l 2 rM, y) - MUx, y). (21) 



/=i 



If i n (x) is the number of orders of size n observed at time x, and 
M(x, y) and V(x, y) denote the mean and variance of the number of 
circuits at time y due to orders observed at time x, then 

M(x, y) = l i n (x)M n (x, y), (22) 
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and 

V(x, y) = I in(x)V n (x, y). (23) 

n 

Note that there is a potential problem if orders can become size zero 
and then become nonzero later, since determination of io, the number 
of active orders of size 0, may be an impossible task. 

Case 2: Order sixes unknown 

We now examine the more difficult case where we observe the total 
number of active circuits at time x (call this k), without observing the 
distribution of the order sizes. The conditional probability that there 
are j x orders of size 1, j 2 orders of size 2, etc., given that k total circuits 
are observed at time x, written dkAJu fa* •••)» is easily found to be 

IL Mxm\] 

8kAju h "" ) " ~~i n kww (24) 

A+3/2+-- * «' 

provided that j t + 2j 2 + • • • = k. Let the conditional first and second 
moment of the number of circuits at time y due to orders observed at 
time x, given that a total of k circuits were observed at time x, be 
M k , x (y) and M {2) kfX (y) respectively. Then 

M k Jy) = 2 EiJtWtx, y), (25) 

where J t is a random variable with the same distribution as the 
conditional number of orders of size i at time x, so that the expectation 
is the expectation with respect to the probability distribution given in 
eq. (24). Also, 

Mgiy) = S E(Ji) V,{x, y) + E((l </,M,(x, y)) 2 ), (26) 

where the expectation is in the same sense as before. Needless to say, 
these expectations with respect to the distribution in (24) are very 
difficult to evaluate for substantial k. 
Things simplify somewhat if 

Mi(x, y) = id(x, y), (27) 

that is, if the conditional means are proportional to the size of the 
order. In this case, (25) and (26) give 

M kA (y) = k0(x,y), (28) 



and 



MZ(y) = I EWViix, y) + k 2 2 (x t y), (29) 
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so that 

V k Ay) = 1 EiJdViix, y), (30) 

where Vijiy) denotes the conditional variance. Equation (30) can be 
approximated by using the following approximation which is intui- 
tively reasonable for k near £ m(x), 

EW.) » k =S£L (31) 

L laAx) 

In this case, (30) and (31) give the following useful approximation: 

VkAy) * k v . 7T • (32) 

3.4 Distribution of future active circuits due to future orders 

In section 3.3, we found the mean and the variance of the number 
of circuits at time y from orders observed at time x. To obtain the 
total number of circuits at time y, we need to add to this the (inde- 
pendent) number of circuits due to orders arriving between time x and 
time y. The number of orders of size n at time y that arrived between 
times x and y is easily seen to be Poisson with mean <x n (x, y), where 



<xn(x, y) = £ 

m=l *** 



\(t)q M (t)P*n(t, y)dt, (33) 



and the number of orders of different sizes are independent of each 
other (see Appendix B). Thus, the number of circuits at time y due to 
arrivals occurring between x and y has a compound Poisson distribu- 
tion (see Appendix A) with mean and variance denoted M*(x, y) and 
V*(x, y), where 

M*(x, y) = l na n (x, y), (34) 

n 

and 

V*(x, y) = l n 2 a n (x, y). (35) 

n 

3.5 Mean and variance of future active circuits 

To find expressions for the mean or variance of the total number of 
active circuits at time y, we merely add together the appropriate means 
or variances from the circuits active at time y due to orders observed 
at time x and from the circuits active at time y due to arrivals between 
x and y, since these are independent. For example, eqs. (28) and (34) 
give 

MUy) = Mix, y) + S„ na n (x, y), (36) 
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where M'kJ.y) is the total mean number of circuits observed at time y 
given k circuits are observed at time x [and assuming relationship 
(27)]. Also, eqs. (32) and (35) give the following approximation: 

wt i \ u Z ai(x)Vi(x, y) , 

VLAy) ~ k . + 2 n 2 a n (x, y), (37) 

1 ia,(x) 

where Vl iX (y) is the similarly defined variance. 

3.6 Churn 

We have previously defined churn as the minimum of the disconnect 
rate per circuit and the connect rate per circuit. Values of churn from 
other definitions are also easily obtained. We will here derive the 
churn, which happens to be a function of time in this case. To compute 
churn we need to know the probability measure for the individual 
order histories. Let U m (t, x) be the expected number of connects 
for an order of size m arriving at time t in the interval [t, x] (thus 
U m (t, t) = m). The expected total connect rate at time x, denoted 
U(x), is then found to be: 



(?£ 



£/(*)=£(X | \(t)q„,(!)UJi,x)dtl (38) 



and similarly for the disconnects using the variable D, 

D( - X) = £ (^ J-cc X <*to«(t)2>«(t, x)dt\ (39) 

and thus we obtain the churn at time x, y(x): 

y(x) = min\D(x)/E[Y(x)], U(x)/E[Y(x)]\, (40) 

where E[ Y(x)] is given by (16). 

IV. THE MODEL FOR SPECIAL-SERVICE CIRCUIT ACTIVITY 

Here, we assume that the demand rate grows exponentially and that 
the behavior of orders is not dependent on the time of arrival. Specif- 
ically, we assume, 

\(t) = A e* (41) 

q m (t) = q m , (42) 

and 

P* mn (t, x) = P mn (x - t). (43) 

Later we will assume a specific form for P mn . 
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Assumptions (41) through (43) are equivalent to: 

1. exponential growth in the rate of new orders at rate p (new orders 
occur as a nonhomogeneous Poisson process), 

2. the probability that a new order is for m circuits is q m , and 

3. an order initially for m circuits requires a total of n circuits after 
2 units of time with probability P mn (z). We will shortly further specify 
P mn to represent unchanging orders of exponential lifetime. 

We now explore the consequences of (41) to (43) in the analysis 
presented in Section III. Substituting into (15) we find that the number 
of orders of size n at time x is Poisson distributed with mean a n (x), 
where 

a n (x) = a„e* (44) 

and 

a n = \olq m I e- fiy P mn (y)dy. (45) 

(The total number of circuits required at any time has a compound 
Poisson distribution, see Appendix A.) Thus, the mean and variance 
of the number of circuits at time x, Y(x), are growing exponentially at 
the same rate, and the ratio remains fixed: 

E[Y(x)} = e &x S na n , (46) 



n=l 



var[Y(*)] = e* x £ n 2 a n , (47) 



n=\ 



or 



var[Y(x)] - vE[Y(x)), (48) 

where 



£ n 2 a n 

V = -55 

2 na n 



^ . (49) 



nmi 



Further results are possible if the behavior for orders over time is 
specified. We assume that the order size does not change over its 
lifetime, which has a common distribution with c.d.f. F independent 
of size. Later we will assume that F is an exponential distribution. 
Although in practice the number of circuits per order does change 
with time, it is conceivable that this movement is relatively unimpor- 
tant; or even if important, that the general form of eqs. (2) and (3) 
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will hold, although the parameter n may then have a different physical 
meaning than we will associate here. In the model suggested here, 
PUy) = F(y)\ P mm (y) = 1 - F(y); P mn (y) = 0, n + 0, n * m. In this 
case we may compute a n more explicitly. Substituting into (45), we 
obtain 

a n = q n \j [1 - F(p)]\ (50) 



where 

e'^dFiy). (51) 



F(P)= f 

Jo 



When the lifetimes are exponentially distributed with mean l/n, i.e. 
F(x) = 1 - e-" x , 

«n = <7nAo(M + P)~ l . (52) 

Also, the batchiness v is related to the order-size distribution; 
substitution into (49) yields 

v = — . (53) 

Inq n 

The assumption that the order size does not change throughout its 
lifetime also allows more explicit representation of the mean and 
variance of the future requirements for circuits. Our development here 
parallels that of Section III. We first compute the probability that an 
observed order will change size during the period of observation. Recall 
that q mn i(t, x, y) is the conditional probability that an order is of size I 
at time y given that it was of size n at time x and arrived as size m at 
time t. We easily obtain: 

,, , F(y - t) 

qmmmit, X, y) = ~=r — , (54) 

F(x - t) 



and 



where 



qmmoit, x, y) = 1 - q mmm (t, x, y), 



F(x) = 1 - F(x), 

and q mm i(t, x, y) = 0, / ¥= 0, / 9* m. The value of q is irrelevant for 
n # m. 

We next find the probability that an order of size n at time x 
becomes of size / at time y, which we denote r n i(x, y). Substitution into 
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(19) gives 



where 



rni(x, y) - 0, n + U n * 0; (55) 

r„ n (x, y) = <7(y - x), (56) 

J e~*F{z + t)dz 
G(t) = -^ ; (57) 

I e- p *F{z)dz 



and 



r„o(*,y) = l-S(y-*). 

Note that in the exponential -lifetime case, where F(z) = e~ M2 , 

G(y) = e~" y . (58) 

We next find the mean and variance of the number of circuits in an 
order at time y, which was observed to be of size n at time x, denoted 
M n (x, y) and V n (x, y), respectively. Substitutions of (55) and (56) into 
(20) and (21) give: 

M n (x, y) = nG(y- x), and (59) 

V n (x, y) = n 2 G(y - x)[l - G(y - x)]. (60) 

Notice that the conditional means are proportional to the size of the 
order, i.e., (59) implies (27). 

We now focus on the mean and variance of the number of circuits 
at time y due to orders which were observed at time x, given that k 
circuits were observed at time x. These quantities are denoted M k , x ( y) 
and V k , x {y) respectively. Equation (28) gives 

M kA (y) = kG(y - x). (61) 

We also conclude that, given the approximation in (32), 

vuv) * *Giy - *>H - tfy - *W l^ 2 ' (62) 

thus 

V k , x m v[l - G(y - x)]M ktX (y), (63) 

where use has been made of (53) and (61). 

We next find the expected number of orders of size n at time y that 
arrived during the interval (x, y) denoted a„(x, y). Use of (3.3) yields 
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a R (x, y) = Ke^qn e~ fiz F(z)dz, (64) 

•fa 

where use is made of the fact P* n (t, ;c) = 9 mm „(t, t, x), which follows 
from (54). When the lifetime distribution for orders is exponential, 
(64) becomes 

otnix, y) = X e^q n ( — 1. (65) 

The mean and variance of all circuits at time y due to orders arriving 
in the interval (x, y), M*(x, y) and V*(x, y), respectively, can be 
obtained by substitution of (64) into (34) and (35) yielding 



ry-x 

M*(x, y) = X l nq n e^ J e~^F{z)dz, 



(66) 



and 

V*(x, y) = pM*(x, y), (67) 

while for exponential lifetimes, 

M*(x, y) = -^2- (e<" - e-"), (68) 

P + M 

where 

D = Ke** 2 nq n , (69) 

and 

t = y — x. 

Note that (61) and (68) [or (36)] give eq. (2), and (63) and (67) [or 
(37)] give (3), since the total number of active circuits at time y is the 
sum of the number of active circuits due to orders present at time x 
and the number of active circuits due to order arrivals between times 
x and y, and these random variables are independent. Equations (5) 
through (9) can easily be derived by the methods described in the 
paper, although we omit the details here. 

Next, turning our attention to churn for the specific model of this 
section, we find that the expected number of connects in the interval 
[t, x] for an order arriving at time t of size m, denoted U m (t, x), is 
given by 

U m (t, x) = m, (70) 

and similarly, 

D m (t, x) = mF(x - t), (71) 
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where the variable D represents disconnects. The total connect rate, 
total disconnect rate, and churn rate at time x, U(x), D(x), and y(x), 
respectively, can be obtained from (38) through (40), yielding 

U(x) = Ace* 1 mm, (72) 

D(X) = X e" x I mq m F(P), (73) 

and 

y(x) = y = 0P(P)/(1 - F(P)), 0^0; 

y( X ) = y = 0/(1 - P(P)), < 0. (74) 

In the special case where lifetimes are exponentially distributed, 

7 = H, P>0'> 

y = p + P, < 0. (75) 

V. ESTIMATION OF THE PARAMETERS OF INTEREST 

In this section, we describe the methodology that can be used to 
estimate the three key parameters of the model; ft the growth rate; n, 
the disconnect rate; and u, the batchiness. 

5. 1 Estimation of 

Suppose that we wish to estimate on the basis of observed arrivals 
of orders, which by assumption occur according to a nonhomogeneous 
Poisson process with intensity \ D e pt . Suppose that the system is 
observed over the interval [-0, 0] and arrivals have been noted at 
times h, ..., t n . We show how to obtain the maximum-likelihood 
estimator for 0. (For a discussion of maximum-likelihood estimation, 
see any elementary book on statistics such as Mood & Graybill.) 6 The 
log-likelihood function, In L(n, h t n ), is easily seen to be 

/l - e^\ 
In L(n, t u ..., t n ) - n lnX + J U - K I— — J . (76) 

Differentiating with respect to X and we find the necessary 
conditions for a maximum: 

£,_^ + X .(i^£!) = 0. (78) 

Using (77) to eliminate X in (78), we obtain 

x(e x - 1) 
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where 

S = ^ + l (80) 

and 

x = 00. (81) 

The function / defined in (79) can be seen to be strictly monotonic 
with range between and 1. Therefore, eq. (79) allows us to solve for 
/30 as f~\S), where S is the statistic defined in (80) equal to the 
proportion of the interval (after — 0) at which the average time of 
arrival occurs. Thus, the maximum-likelihood estimator for /?, written 
J3, is given by 

*-qp. (82, 

The function / has the properties: 

/(-oo) = 0,/(0) = l/2,/(oo) = l, and /(*) + f(-x) - 1. 
Thus, 

/"'(O) = -oo, 

r r u/2) = o, 

ni) = ». 

and 

r\\/2 - X ) = -rni/2 + x). 

The function f~ l is tabulated in Table I. 
For small x, f(x) may readily be expanded in the power series: 

f(x) = l/2h + (l/6)x-^x 3 ...] 

so that 

rHl/2 + y) = 12y + 28.8y 3 - • • . (83) 

Similarly, a large / expansion yields 

f-\\ - 1/y) * y - y 2 e~ y . (84) 

We may also determine the mean and variance of the statistic S 
given the correct parameter /3 and the number of observed arrivals. It 
is well known that the distribution of the arrival times for a nonho- 
mogeneous Poisson process, conditioned on a given number of arrivals 
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Table I — Values of the function f" 1 useful in 

estimating the growth rate 0, and several 

approximations for the function 

[see eq. (79) and following] 









Approxima- 


Approxima- 


X 


f-\X) 


12X-6 


tion in (83) 


tion in (84) 


0.50 


0.0000 


o.oobo 


0.0000 




0.52 


0.2402 


0.2400 


0.2402 




0.54 


0.4819 


0.4800 


0.4818 




0.56 


0.7263 


0.7200 


0.7262 




0.58 


0.9751 


0.9600 


0.9747 




0.60 


1.2299 


1.2000 


1.2288 




0.62 


1.4926 


1.4400 


1.4898 




0.64 


1.7654 


1.6800 


1.7590 




0.66 


2.0507 


1.9200 


2.0380 




0.68 


2.3517 




2.3280 




0.70 


2.6721 




2.6308 




0.72 


3.0168 




2.9467 




0.74 


3.3920 








0.76 


3.8060 








0.78 


4.2703 








0.80 


4.8010 






4.8316 


0.82 


5.4219 






5.4362 


0.84 


6.1691 






6.1746 


0.86 


7.1010 






7.1025 


0.88 


8.3164 






8.3166 


0.90 


9.9954 






9.9955 


0.92 


12.4994 






12.4994 


0.94 


16.6667 






16.6667 


0.96 


25.0000 






25.0000 


0.98 


50.0000 






50.0000 


1.00 


00 






00 



in the interval, is the same as the order statistics from n i.i.d. random 
variables with probability density proportional to the arrival rate. 
Thus S has the distribution of the average of n i.i.d. random variables, 
7, on [0, 1] with density g{p), where 



xe f 
g(p) = 



e x - 1 

and 

x = 0®. 
It is easily seen that 

E(Y)=f(x) (85) 

and 

var(y) = ^- _, , I, „ . (86) 



X 



e x + e' x - 2 



Equation (86) is valid if x * 0; when x = 0, var( Y) = 1/12, the limit 
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of (86) as x goes to 0. The expression for the variance is symmetric in 
x and takes its maximum value at x = 0. 

Thus, for a given value of the growth rate /3, and a (large) given 
number of observations n, the observed statistic S is approximately 
normally distributed with mean f(x) and variance less than or equal 
to l/12n. This observation can be readily translated into confidence 
intervals through the use of elementary statistical theory. For example, 
a 95-percent confidence interval foi f(x) (assuming normality of the 
statistic) is 

5 " L96 Vrk^w^ 1 - 96 VX (87) 



which translates to 



f' 1 { s ~ 1M Vs) * " Q 




r\S+ 1.96 \/jg|. (88) 



If S is close to 0.5, then we can use / '(*) * 12x — 6 [see (83)] to 
obtain for the 95 -percent confidence interval for 00 

6 79 
j80 = 12S - 6 ± -^ . (89) 

y/n 

5.2 Estimation of v 

There are several possible statistics for the measurement of the 
batchiness v. We shall take as our starting point eq. (49) which defines 
the batchiness in terms of the distribution of the order size at (any) 
point in time. This is preferable and is more robust than using the 
distribution of the order size on arrival, although the two happen to 
equal when order sizes do not change with time. Thus, if v is to be 
estimated based on observation of the system at a given point in time 
at which i n orders of size n are observed, then a reasonable estimator 
for v, which we write v, is: 

*-££. (90) 

2 ni n 

When the number of circuits does change during the lifetime of an 
order, it is possible that the form of (3) still holds. In this case, it is 
likely that the parameter v which multiplies each of the two terms in 
(3) is different. Equation (90) is a reasonable estimator for the multi- 
plier of the second term. The multiplier of the first term should be 
estimated by other methods. 
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Table II— Notation 



D Present (t = 0) circuit demand rate due to new orders. 

D(x) Expected total disconnect rate at time x. 

D m (t, x) Expected number of disconnects in the interval [t, x] for an order 

of size m arriving at time t. 
i n (x) Observed number of orders of size n at time x. 

j t Conditional number of orders of size i due to orders observed at 

time x given that k circuits were observed at time x. 
M(x, y ) Mean total number of circuits at time y due to orders. 

M*(x, y ) Mean of the total number of circuits at time y due to orders arriving 

between x and y. 
Ml^y) Mean of the number of circuits at time y given k are observed at 

time x. 
MhAy) Conditional expectation of the number of circuits at time v due to 

orders observed at time x given that k circuits are observed at time 

x. 
Mi 2 J(y ) Conditional second moment of the number of circuits at time y due 

to orders observed. 
M„(x, y) Mean order size at time y for an order observed to be of size n at 

time x. 
P^t) Probability that an order of initial size m becomes of size n, t time 

units after arrival. 
Pmn(t, x) Probability that an order arriving at time t of initial size m becomes 

of size n at time x. 
q m q m (t) when there is no dependence on t. 

q m (t) Probability that an order at time t is initially for m circuits. 

qmni(t, x, y) Conditional probability that an order arriving as size m at time t 

and of size n at time x becomes of size / at time y. 
rnl ( x , y) Probability that an order of size n at time x becomes an order of 

size / at time y. 
U(x) Expected total connect rate at time x. 

U„(t, x) Expected number of connects for an order of size m arriving at time 

t in the interval [t, x]. 
V(x, y) Variance of the total number of circuits at time y due to orders 

observed active at time x. 
V*(x, y) Variance of the total number of circuits at time y due to orders 

arriving between x and y. 
Vj x ( y ) Variance of the number of circuits at time y given that k are observed 

at time x. 
V M (y) Conditional variance of the total number of circuits at time y due 

to orders observed at x given that k circuits are observed at time x. 
V n (x, y) Variance of order size at time y for an order observed to be of size 

n at time x. 
Y(x) Total number of active special service circuits at time x. 

a n Constant of proportionality for the exponential growth of a n {x). 

a „(x) Mean number of orders of size n at time x. 

a n (x,y) The number of orders of size n at time y which arrived between 

times x and y. 
f) Growth rate. 

7 Churn. 

fifcxO'i, hi ■ ■ •) Conditional probability that there are j\ orders of size 1, j 2 orders 

of size 2, etc., at time x given that k total circuits are observed at 

time x. 
0(x, y) Defined in (27). 

Length of observation period. 

X,, Present (t = 0) arrival rate of orders. 

X(r) Instantaneous arrival rate of orders at time t. 

p Disconnect rate (per circuit). 

v Batchiness. 

Pmn (t, x) The conditional probability density that an order arrived at time t 

of initial size m given that it is of size n at time x. 



5.3 Estimation of n 

The estimation of n is relatively straightforward. The maximum- 
likelihood estimator is given in (14) and further details including 
estimated values by service family are given in Reed and Smith. 2 

5.4 Estimation of D 

Equation (77) allows the MLE estimator of X , 

X ° = rr^e-> (91) 

where the estimator $ has been previously described in (82). The 
estimator of D (the instantaneous demand at the end of the obser- 
vation interval of length 0), D then is 

&o = KN, (92) 

where N is an estimate of the average batch size. The previous 
expectation can be estimated from the order sizes at arrival epochs, or 
more crudely from the general distribution of order sizes at an arbitrary 
point of time. 

Interestingly enough, D /(fi + /?) can be estimated solely from the 
number of active circuits at a point of time. For simplicity, we assume 
that the orders are solely of size one, although the analysis could be 
repeated for other distributions. In this case, the following analysis is 
applicable. 

Suppose that the number of active circuits at time £ is a Poisson 

random variable with mean \e m . The time that the mean is between 

x and x + dx is dx/x/3. The expected time that the mean is between x 

dx x 

and x + dx and a total of k active circuits are observed is e~ x . 

xp k! 

Thus, if k active circuits are observed, the conditional distribution 

of the mean (in our case this is D /(n + (3)) has density proportional 

to x k ~ l e~ x , or is a standard gamma random variable with k degrees 

of freedom. This random variable has mean and variance equal 

to k. Thus, if k circuits are observed, the conditional distribution of 

DJ(n + /3) has mean k and variance k, and is approximately normally 

distributed if k is large. This information can be used to modify eqs. 

(2) and (3), to take into account the variability ofD /(n + /S) to obtain: 

M k (t) = ke 0t , (93) 

and 

V k (t) = vk[e ot - e" 2 "'] + k[e* - e-") 2 , (94) 

where the subscript k on the variables on the left-hand side indicates 
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conditional means and variances knowing 0, /*, and v but not knowing 
D . Note that the variance-to-mean ratio is unbounded for increasing 
t, since errors in estimation of D accumulate indefinitely. 

VI. SUMMARY 

We have described a model for special-services activity useful in 
forecasting special-services requirements. It requires three physical 
characterizations of the process (growth rate, disconnect rate, and 
batchiness) and two instantaneous measurements (the current number 
of active circuits and the instantaneous rate of new connects). We 
give means and variances for the numbers of active circuits at a given 
point in the future and for the total number of connects or disconnects 
during a future period. The distribution of these variables can be 
computed by the methodology described in the paper. We also describe 
general techniques for estimation of the required parameters. 

Work is being undertaken to verify and calibrate the model with 
the New Jersey Bell Telephone Co. database and will be reported 
elsewhere. 
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APPENDIX A 

The Compound Poisson Random Variable 

Briefly, a random variable is said to be a compound Poisson random 
variable if it can be thought of as the sum of a Poisson number of 
independent identically distributed positive integer-valued random 
variables. Thus Y is a compound Poisson random variable if 

N 

Y= IXu 

j=i 

where N is a Poisson random variable with mean a, N and the X, are 
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independent and 

P\X i = n\=p n . 

We have rather easily 

E(Y) = aE(X) = alnp n 

Var(Y) = aE(X 2 ) = a £n 2 p„. 

An alternative (and equivalent) way of specifying a compound Poisson 
random variable is 

Y = i iZ h 

i=i 

where 

Zt are independent Poisson random variables with E(Zi) = ap, = a,. 
In this case it is convenient to think of Z, as the number of batches or 
orders of size n that are aggregated to give the total number denoted 
Y. 

APPENDIX B 

The Nonhomogeneous Poisson Process 

A process which counts events is a nonhomogeneous Poisson process 
(see, for example, Ross, Ref. 5) with intensity \(t) ^ if the number 
of events in the interval [x, y] is a Poisson random variable with mean 
j y x \(t)dt, and the number of events in disjoint intervals are independ- 
ent. 

Fact: If events from a nonhomogeneous Poisson process are of two 
types, and an event at time t is of Type 1 with probability p(t), then 
the process which counts Type 1 events is a nonhomogeneous Poisson 
process with intensity \{t)p(t), and it is independent of the counting 
process which counts Type 2 events [which is a nonhomogeneous 
Poisson process intensity A(£)(l ~ pit)]. 

Fact: Suppose that a nonhomogeneous Poisson process is observed 
over the interval [x, y] and n events are observed. If the times of these 
events are arranged in random order, their distribution is identical to 
that of n independent identically distributed random variables whose 
density at t is 

X(t) 



I 



y 

\{z)dZ 



if t G [x, y] and is zero otherwise. 
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